Additive noise and channel distortion-robust parametrization tool - performance evaluation on Aurora 2 & 3
نویسندگان
چکیده
In this paper a HTK-compatible robust speech parametrization tool CtuCopy is presented. This tool allows for the usage of several additive noise suppression preprocessing techniques, nonlinear spectrum transformation, RASTA-like filtration, and direct final feature computation. The tool is general, it is easily extendible, and it may be also used for speech enhancement purposes. In the second part, parametrizations combining the extended spectral subtraction for additive noise suppression and LDA RASTA-like filtration for channel-distortion elimination with final computation of PLP cepstral coefficients are examined and evaluated on Aurora 2 & 3 and Czech SpeechDat corpora. This comparison shows specific algorithm features and the differences in their behavior on above mentioned databases. PLP cepstral coefficients with both extended spectral subtraction and LDA RASTA-like filtration seem to be good choice for noise robust parametrization.
منابع مشابه
Performance Evaluation of CMN for Mel-LPC based Speech Recognition in Different Noisy Environments
This study is intended to develop a noise robust distributed speech recognizer for real-world applications by employing Cepstral Mean Normalization (CMN) for robust feature extraction. The main focus of the work is to cope with different noisy environments. To realize this objective, Mel-LP based speech analysis has been used in speech coding on the linear frequency scale by applying a first-or...
متن کاملTowards High-Accuracy Low-Cost Noisy Robust Speech Recognition Exploiting Structured Model
It is well known that the distorted speech can be considered generated from the clean speech with the additive noise and the convolutive channel as In this paper, we present our recent study on using this structured model of physical distortion for robust automatic speech recognition. Three methods are introduced for joint compensation of additive and convolutive distortions (JAC), with diffe...
متن کاملPerformance Evaluation of Blind Equalization for Mel-LPC based Speech Recognition under Different Noisy Conditions
This study is aimed to develop a noise robust distributed speech recognizer (DSR) for real-world applications by employing Blind Equalization (BEQ) for robust feature extraction. The main focus of the work is to cope with different noisy environments in recognition phase. To realize this objective, Mel-LP based speech analysis has been used in speech coding on the linear frequency scale by appl...
متن کاملA unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
In this paper, we present our recent development of a model-domain environment-robust adaptation algorithm, which demonstrates high performance in the standard Aurora 2 speech recognition task. The algorithm consists of two main steps. First, the noise and channel parameters are estimated using multi-sources of information including a nonlinear environment distortion model in the cepstral domai...
متن کاملChannel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR
Vector Taylor Series (VTS) is a powerful technique for robust ASR but, in its standard form, it can only be applied to log-filter bank and MFCC features. In earlier work, we presented a generalised VTS (gVTS) that extends the applicability of VTS to front-ends which employ a power transformation non-linearity. gVTS was shown to provide performance improvements in both clean and additive noise c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003